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Abstract Adaptive Radio Resource Allocation is essential for guaranteeing high band- 
width and power utilization as well as satisfying heterogeneous Quality-of-Service re- 
quests regarding next generation broadband multicarrier wireless access networks like 
LTE and Mobile WiMAX. A downlink OFDMA single-cell scenario is considered where 
heterogeneous Constant-Bit-Rate and Best-Effort QoS profiles coexist and the power 
is uniformly spread over the system bandwidth utilizing a Uniform Power Loading 
(UPL) scenario. We express this particular QoS provision scenario in mathematical 
terms, as a variation of the well-known generalized assignment problem answered in 
the combinatorial optimization field. Based on this concept, we propose two heuristic 
search algorithms for dynamically allocating subchannels to the competing QoS classes 
and users which are executed under polynomially-bounded cost. We also propose an 
Integer Linear Programming model for optimally solving and acquiring a performance 
upper bound for the same problem at reasonable yet high execution times. Through 
extensive simulation results we show that the proposed algorithms exhibit high close- 
to-optimal performance, thus comprising attractive candidates for implementation in 
^ , modern OFDMA-based systems. 
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1 Introduction 

Radio Resource Allocation (RRA) mechanisms are expected to play a key role in 
emerging and future OFDMA-based multiuser wireless access networks. RRA aims 
at simultaneously guaranteeing high utilization of the available system resources, sat- 
isfying the individual Quality-of-Service (QoS) requirements of the competing users, 
and maximizing overall system performance. In order to accomplish these targets, an 
RRA or Frequency-Domain Packet Scheduling (under the LTE terminology) technique 
exploits the differentiated channel conditions experienced by the various users over the 
available bandwidth. In particular, a complete allocation decision comprises the specific 
set of OFDM subchannels assigned to each user as well as the transmission format, 
namely the amount of power and the modulation mode, for each resource block pp. 

1.1 Background and Related Work 

Adaptive RRA aims at positively exploiting the rate differentiation that occurs on two 
levels closely related to respective physical phenomena: the single-user centric variation 
of the achieved rate on a subchannel basis and the multi-user centric differentiation 
of the rate achieved by each user on each subchannel. The former phenomenon is 
related to the inherent frequency selectivity of the wideband wireless medium, while 
the latter to the so-called multi-user diversity caused by the statistical independence of 
the corresponding subchannels. Therefore, by assigning to each user the subchannels 
that experience favorable channel (and thus rate) conditions, we expect to significantly 
improve system performance. 

In [2] the above concepts were thoroughly presented first for single- and then for 
multi-antenna wireless mobile systems. According to this concept each receiver mon- 
itors the experienced SNR levels, feeds them back to the BS, while the BS schedules 
transmissions and adapts users' bit rates depending on the particular channel qual- 
ity reports. Similar arguments were raised in [3] where it was demonstrated that for 
2G/3G systems the cellular spectral efficiency may significantly improve, even double 
at certain conditions, when the BS utilizes the per- user channel/rate information. In 
[4] the ideas were extended to multi-channel OFDM wireless systems like the one we 
examine in this paper. The widely used term "opportunistic" bears a strong relation 
with our work, since we tend to allocate subchannels with high channel conditions (near 
to their peak) to the corresponding users (frequency-domain opportunism), while in 
[5] a similar policy is employed in the time-domain. 

The works of [5] and [6] were the first to introduce an optimization framework for 
handling multi-user OFDM resource allocation problems, paving the way for an exten- 
sive utilization of concepts and methods addressed in the engineering optimization field, 
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towards efficiently assigning system resources to active users. The original formulations 
of the respective system power minimization [5] and sum-throughput maximization [6] 
problems indicated the hard large-scale non-linear integer nature of the underlying 
RRA decision problems, rendering the straightforward discovery of the optimal alloca- 
tion practically impossible. Towards relaxing the complexity of the related problems, 
a plethora of simplifying approaches has been proposed in the literature aiming at the 
development of suboptimal, yet efficient and computationally tractable allocation rou- 
tines [?1|8]. The works in [9] and [10] provide extensive surveys on the methodological 
and algorithmic aspects of the particular research area. A popular assumption which 
was first proposed in [11] and further elaborated in [12] and [13] regarded the uniform 
power loading (UPL) of the system subchannels. Under a known power distribution, 
the achieved bit-rates for all the possible subchannel/user combinations can be pre- 
calculated, and thus the original multidimensional RRA problems resort to exclusive 
subchannel allocation problems. From a system perspective, OFDM A is considered the 
major transmission and multiple access technology for modern wireless networks such 
as 3GPP-LTE [14], Mobile WiMAX [15] and also plays a key role in the new paradigm 
of Cognitive Radio Networking [16] . In particular, LTE-oriented works may be found 
in [T7i 18 , WiMax-oriented ones in [19.ll20.U2l] and cognitive networking related in |22l 
[23] correspondingly. 

1.2 Contributions and Novelty 

In this paper we examine a single-cell downlink OFDMA system scenario supporting 
multiple QoS profiles under the UPL assumption. Unlike single-profile studies, which 
have attracted enormous interest in the related literature, the heterogeneous problem, 
which is more interesting and realistic has not been given great attention. In [21] a 
mixed CBR/VBR scenario was introduced for the single-user case, while in our work 
we consider multiple users demanding mixed services. The multi-user case was studied 
in [25II26| where suboptimal allocation algorithms were proposed. However the UPL 
hypothesis have not been taken into account as in our work and the efficiency of 
the algorithms compared to the maximum achieved performance (that is, the optimal 
one) was not demonstrated. Finally, in [12] an efficient UPL-based resource allocation 
algorithm was proposed guaranteeing a set of minimum bit rates, while in our paper a 
more realistic QoS scenario is assumed, comprising mixed CBR and Best-Effort traffic 
profiles. 

In particular, in this work we employ two optimization approaches to our problem. 
The former is based on Integer Linear Programming (ILP) and allows us to find: (a) 
the exact optimal resource allocation decision as well as the maximum sum-rate per- 
formance in reasonable, yet high execution time, and (b) a performance upper bound 
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in polynomial time solving the LP-relaxed version of the IP formulation. The second 
approach is based on our observation that the specific heterogeneous formulation bears 
similarities with a well-known combinatorial optimization problem, the Generalized 
Assignment Problem (GAP) [27] . In order to efficiently solve the GAP formulation we 
devise two heuristic schemes, which are executed with polynomially bounded computa- 
tional cost. The development is mainly inspired by the ideas utilized in the respective 
GAP heuristics found in the related literature. The examined schemes are tested in 
an extensive set of realistic computational-based network scenarios, in terms of the 
achieved sum-rate performance. The simulation results show that the devised heuristic 
schemes: (i) outperform drastically a semi-random subchannel assignment approach, 
and (ii) perform close to the exact optimal allocation, while their complexity cost is 
significantly lower. 

1.3 Paper Structure 

The remaining of the paper is structured as follows. In Sec. [2] we present the adopted 
system model and the holding assumptions, state the multiple-QoS profile RRA prob- 
lem and formulate it as a Binary Integer Linear Programming optimization problem. 
In Sec. |3] we discuss how we can express our problem as a variation of the GAP com- 
binatorial format, and which are the advantages of this approach, while in Sec. [4] we 
devise two heuristic allocation algorithms inspired by the GAP concept and discuss 
computational complexity issues. In Sec. [S] we present the results of the computational 
experiments and comment on them, while in Sec.[6]we summarize our work and propose 
possible extensions of it. 

2 System Model and Mathematical Formulation 

2.1 System Description and Assumptions 

The downlink (DL) of an OFDMA cell is considered in the context of this work, where 
a Base Station (BS) located at the center of a 2D area is fully responsible for allocat- 
ing system subchannels (or equivalently physical resource blocks according to the LTE 
terminology) to the competing users. The available bandwidth comprises TV mutually- 
orthogonal subchannels (or resource blocks) forming the set S = {1, 2, . . . , n, . . . TV}. 
The available transmission power Pf, s is uniformly spread over the bandwidth, namely, 
each subchannel n is loaded with an equal amount of power given by P n = P^g/N. 
We assume that K users are present in the cell, forming the corresponding set U — 
{1, 2, . . . , k, . . . K}, and that each user is assigned one of the two available QoS pro- 
files/classes, the Constant-Bit-Rate (CBR) or the Best-Effort (BE) one. Users belong- 
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ing to the CBR-class subset (Wcbr) demand a specific constant service rate denoted 
by 7?™ m (Vfc £ Wcbr)> representing Voice/ Video- like services. On the other hand, users 
belonging to the BE-class subset (Wbe) have infinitely backlogged queues and no strict 
QoS guarantee, reflecting FTP-downloading services. 

In order to perform adaptive resource allocation in the above typical multi-user 
OFDMA system setup the following key operations/assumptions are supported: 

— Each user estimates the BS-user link channel quality over the available subchannels 
and feeds back this information to the BS through an error-free and delay-less 
uplink signaling channel. 

— Due to the wideband and multi-user nature of the transmission medium [28] each 
user k achieves different rate performance over each subchannel n, given by r n % 
bits/symbol. Given that the BS exactly knows the wideband and multi-user channel 
response as well as the allocated power per subchannel P n , the r n /. bit rates are 
pre-calculated (please refer to the discussion in Appedix lA")) . 

— Based on the above 2D rate matrix and the QoS targets, the BS decides the exact 
subchannel set assignments for each user. This information is also transmitted to 
each user in order for the useful data to be correctly decoded. 

— Each subchannel should be allocated to a single user, thus avoiding intra-cell inter- 
user interference pQ. 

2.2 Mathematical Formulation and Optimal Solution 

We introduce N • K binary integer variables notated by p n /. where p n = 1 if the 
n th arbitrary subchannel is allocated to the k th arbitrary user, else equals zero. The 
data-rate r n k supported on each subchannel/user combination can be pre-calculated 
based on the abstraction modeling function given in Appedix(X] We are now able to 
formally define the RRA problem, which constitutes the identification of the allocation 
decision that maximizes the cell sum-rate (Rtot), satisfies the individual data-rate 
constraints for the CBR-class users and preserves subchannels orthogonality. Eq. (|la|l 
express the system- wise objective function, Eq. (|lb|l the K\ (|WcbrI) Q°S constraints, 
and Eq. (|lc|) the N subchannel-sharing-avoidance system constraints. Power availability 
and minimum BER constraints are handled implicitly as explained in Appendix [X] 
Although the generalized bandwidth and power allocation problem is non-linear [5] , the 
UPL hypothesis allows us to recast RRA in a linear form, as shown in Eq.JTJ. Actually, 
due to the exclusive assignment of each subchannel to one user, the UPL variation of 
RRA is actually an Integer Linear Programming (ILP) optimization problem. In fact 
we are dealing with an ILP problem, including N • K binary variables and Wcbr + N 
equality constraints, where N ~ 100 and K ~ 10—20 under a typical network setup [14] . 
Despite the fact that ILP problems are still hard to solve due to their NP-hard nature, 
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efficient methods and solvers like CPLEX, SCIP or GUROBI are available and may be 
employed for problems of such scale. 

Finally, an upper bound on the system performance could be estimated by relaxing 
the integrality constraints or equivalently allowing for one or more subchannels to be 
shared among users. For this case an LP solver could solve the specific problem with 
3rd order polynomial complexity cost. Note however that the LP relaxation approach 
is able to provide us only with the system performance (in fact the theoretical achieved 
upper bound) and not with an implementable allocation decision due to the violation 
of the subchannel-sharing constraints set. We have to emphasize the implementation 
limitations of both IP and LP approaches, underlying that their value is mostly theo- 
retical. For this reason we proceed with developing simpler allocation schemes in the 
following two sections. 

maxJ? tot = ^2 ^ r n,k ■ Pn,k = ^min + r n,k ' Pn,k (la) 

n£S k£U k£UcBK n£S fcet/sE 

subject to 

r n ,k ■ Pn,k = -Rmim Vfc £ W C BR (lb) 

^2p ntk = l,\/neS,pel NxK ,I={0,l} (lc) 



3 Radio Resource Allocation as a Variation of the Generalized Assignment 
Problem 

The Generalized Assignment Problem (GAP) along with its variations is a well-studied 
special formulation of Combinatorial Optimization. Although GAP is still an NP-hard 
problem, several efficient approximate heuristic algorithms have been proposed for 
solving it 27,29] due to its wide application in real- world problems. A Knapsack-based 
definition of GAP proposed in [27] constitutes the assignment of items to knapsacks 
in order to maximize the overall profit while not exceeding the capacity constraints of 
each knapsack. The assignment of each item to a knapsack incurs different profit and 
induces different cost. Note that an item can be obviously assigned to a single knapsack. 
A slightly different definition is given in [25] , where one searches for the best scheduling 
of tasks or jobs to agents, in order to minimize the overall processing cost and do not 
violate the available resource budget of each agent. Again a task cannot be split into 
multiple agents. Finally, a variation of the GAP, known as the Covering Assignment 
Problem (CAP) is given in 30 which applies to a dairy industry distribution scenario. 
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Specifically, CAP regards the optimal distribution procedure of the milk produced by 
several farms to a set of factories, in order to minimize the cost of processing and 
simultaneously satisfy the minimum resource demand of each factory. 

Towards expressing our RRA problem as a GAP, we first define the correspondence 
of an item/task/farm to a subchannel and a knapsack/agent/factory to a user of the 
system. RRA may then be seen as the optimal exclusive allocation of each subchan- 
nel to a single user in order to maximize the sum-rate (profit) function and provide 
minimum rate (resource) assignments for a user subset. We also have to mention that 
our formulation possesses several distinctive features compared to the aforementioned 
classical GAP/CAP approaches: 



— The constraint expressions are tight whereas in GAP and CAP they correspond to 
left-hand-side or right-hand-side inequalities 

— The constraint expressions are imposed on a subset of users and not over the 
complete users' set 

— the profit /weight/cost factors are equal {r n j.) and depend on both subchannel and 
user indexes through the experienced rate/channel conditions. 



As far as GAP and its variations, several efficient heuristic algorithms could be found in 
the literature. In [27] the authors proposed a scheme which was based on the construc- 
tion of an initial effective feasible solution and its subsequent improvement through 
item reallocations. In [31] the author proposed a "dual" algorithm, according to which 
the optimal solution was approached from the exterior of the feasible region. Finally, 
in |30| the authors devised similar schemes for the CAP variation. In the following sec- 
tion we devise two algorithms, inspired by the previous GAP/CAP works. We remark 
that due to the differences between our RRA problem and the classical formulations, 
none of the existing algorithms could be employed as-is. Moreover, although other so- 
lution approaches may be found in the literature, such as the Lagrangean Relaxation, 
Branch-and-Bound, Metaheuristics, etc. (see for example a survey in [32]) we focus on 
the heuristic solution approach since it is appealing from an implementation point of 
view. One should bear in mind that in a practical network scenario the complete re- 
source assignment must be updated every transmission frame. For example, regarding 
contemporary wireless access systems like LTE and WiMAX, the update rate should 
be of the order of 0.5-1 msec [14] . 
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4 Heuristics for Solving the Resource Allocation Problem 

4.1 Heuristic I: Approaching the Optimal Allocation from the Interior of the Feasible 
Space 

Description The first heuristic is mainly inspired by the Knapsack solution approach 
given in [27] . Resource Allocation is performed in 2 phases, that is the construction of 
an initial feasible solution and the search for a gradually improved allocation. Hereafter, 
when we refer to a "feasible" allocation decision, we will mean that the |Z-/cbrI data 
rate constraints are satisfied. As far as the remaining N orthogonality constraints, these 
will be implicitly met, since at each subchannel allocation step, the subchannel which 
is assigned to a user will be removed from the available bandwidth "pool". Hence, 
we should not confuse the use of the feasibility term with the one regarding the LP 
relaxation given in Sec[2] The complete algorithm including all the intermediate steps 
for each phase is given after the description of its key features. 

During the 1st phase, our primary objective is to construct a feasible RRA solution, 
that is, satisfy the strict data-rate constraints of all CBR users, while secondarily the 
respective allocation has to be as efficient as possible in terms of the overall system 
performance. Towards the 1st point we prioritize the CBR-class subchannel allocation 
procedure over BE-class, thus decoupling the inter-class problem to two consecutive 
intra-class sub-problems. Concerning the secondary target, we state that if the above 
QoS constraints are satisfied by utilizing the minimum amount of the available Tx 
power as well as the "best" subchannels in terms of the achieved channel/rate con- 
ditions, then plenty of resources will be available for BE-class users. Such a policy 
would enhance system performance, since BE users are the ones that contribute to 
the objective function, due to the predetermined static nature of the CBR-class users' 
data-rates. Hence, after the necessary initializations and declarations (Step 1), an it- 
erative joint subchannel/user allocation is applied over the CBR users subset ignoring 
BE users (Step 2). The selection criterion for picking the best CBR subchannel-user 
pair at each iteration is dual: on the one hand the user experiencing the minimum 
averaged achieved data-rate over the remaining available subchannels set is prioritized 
and on the other hand the "best" subchannel in terms of rate performance is promoted. 
The user-selection criterion prioritizes the users with the worse rate conditions, which 
are expected to consume the largest portion of resources, and by allocating to them 
their most efficient subchannels we succeed into constraining the overall amount of 
CBR-class resources consumption. 

Subsequently, we finalize the initial feasible RRA solution by assigning the unallo- 
cated resources to the BE users (Step 3). Since no QoS requirements are imposed for 
the particular class, the simple policy of best-rate user allocation is also the optimal 
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one (see also [TT]). Up to now, we have provided a feasible allocation decision, how- 
ever we argue that we could further improve it since the previous steps may induce 
suboptimality to the sum-rate performance. Note that this is caused by the "greedy" 
(local) nature of the heuristic allocations during Step 2. Towards the above purpose we 
employ a series of subchannels swaps (Step 4). Specifically, we perform a subchannel 
interchange between 2 users if and only if the system performance is enhanced and 
simultaneously the feasibility for any CBR user is not violated. Due to the presence of 
multiple QoS classes, we have to discriminate between 2 2 = 4 possible swap scenarios (2 
classes are considered in this work) . Finally in order to limit the number of interchanges 
(and thus the execution complexity) we perform only one round of comparisons and 
possible reallocations by a single sweep of the set of system users. We remark that by 
increasing the comparison rounds, a marginal additional performance gain is expected, 
since already at each user iteration the owner's subchannels are compared with the 
subchannels of all the other users. 

The last subprocedure given in Step 5 comprises the release of possible redundant 
subchannels of CBR-class users due to the active nature of the related constraints and 
the reallocation of them to a BE user. We put an emphasis on the fact that for CBR 
users surplus allocated data-rates are ignored. 



Complete Algorithm, 



Step 1 Initializations-Declarations 

Let { r n,fc} ne5 the known achieved data-rates for all subchannel/user combina- 
tions and {p n , k \ nl - s the set of the allocation variables. 

N 

Define: lhs k = £ r n>k ■ p rhk and b k = i?„ lin ,Vfc £ W C br- 



n=l 



Step 2 Subchannel Assignment to the CBR-class users - Feasible Allocation 

Define the available subchannels pool Af and the subset of the unsatisfied CBR users 
/C as: 

JV= jneS: J2 p n<k =o\,lC= {k e Wcbr : lhs k - b k < 0}. 

N 

Update lhs k = £ r„ jfc • p n ^ k and b k = Vfc G W C BR- 

n=l 

Pick the subchannel/user combination according to: 

k* = argmirJ (l/\J\f\) ■ J2 r n.k \ ,«* = arg max {r„ fc . } and 
fce/C I neAf J n£Af 

Perform the subchannel allocation: p n » k * = 1 

If K, 7^ Repeat Step 2 else Break the Loop and Go To Step 3 
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Step 3 Subchannel Allocation to BE-class users - Finalization of 'the Feasible Solution 

Assign each unallocated subchannel to the "best" users in terms of the achieved rate: 
For each n G A/", set p n t» = 1, where fc* — arg max |r„ k \ 

k£U B E ' 

Step 4 Subchannel Swapping ~ Improved Solution 

For every user fc G U (either CBR or BE) repeat the following procedure serially: 

(a) Find his/her allocated subchannels set: Ij. = {n 6 5 : Pn,k — l} 

(b) Find the complementary users/subchan. sets : U' k = U\ {k} ,X' k = S\L k 

(c) For all possible combinations ((n,k), (n',k'^/, where n G G l' k ,k G U' k 
check the inner conditional expression depending on the active scenario: 

(c.l) If k G Uqbr and k! G Ucbr- »V,jfc > r n k && Ihsy + 

' ?i,k' ^'n' .k' & °k' 
or r n,k' > T n'M' && lhs k + r n> ,k ~ r n,k 3 s b k 

(c.2) If fc G W B e and k' G W B e: r n / )fc - r n;k + r nM - r n , yk , > 
(c.3) If k G Ucbr and fc' G W B e: »"„,&' > r n ',fc' ^ s fc + 
(c.4) If & € W B E and fc' 6 U C BR- ?V )fe > r n /c lhs k > + 

If any of the conditions is TRUE employ the corresponding resource swapping: 

Pn,k = 0,Pn',k' = 0,Pn',k = 1, Pn,k< = 1- 

Step 5 Release of CBR-class Subchannels for BE-class 
For every CBR user fc G Wcbr : 

(a) Find his/her allocated subchannels set: T k = {n G S : p nk = l} 

(b) For each n G T k if Z/isj, — r n k ^ b k release it from this user and allocated it to the 
best BE-user: p nk = 0, p n . k * = 1, where fc* = arg max {r nyk } 

4.2 Heuristic II: A Dual Approach of the Optimal Allocation 

Description An alternative heuristic algorithm inspired by the works of [31] and |30| . 
where the optimal allocation decision is approached from the exterior of the feasible 
space, is proposed in the present subsection. The "dual" approach shares also several 
similarities with the Lagrangean Relaxation technique, which dualizes the hard con- 
straints. Specifically, the solution is approached on 2 phases: the construction of the 
optimal unconstrained subchannel allocation and the feasibility transformation of the 
initial decision through a series of subchannel reallocations. All the intermediate alloca- 
tions generated by the dual approach are infeasible except for the last one. Note, that a 
similar algorithm for a single-QoS profile RRA problem has been proposed in [12j . We 
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now provide a summary of the algorithm as well as a complete step-by-step description 
of the dual approach in the context of our RRA problem. 

During the 1st phase, the necessary initializations and declarations (Step 1) as 
well as the optimal unconstrained subchannel allocation are performed (Step 2). The 
optimal solution to the corresponding unconstrained problem is extracted by simply 
allocating each subchannel to the user that experiences the maximum data-rate on 
it. Obviously, the sum-rate performance of the best-user allocation policy is an upper 
bound on any QoS-constrained scenario. The 2nd phase on the other hand comprises 
a series of subchannel reallocations aiming at satisfying the minimum QoS targets or 
equivalently rendering the solution feasible (Step 3JJ. The selection of user/subchannel 
pairs participating in each reallocation cycle is dictated by the efficiency metric shown 
in Eq.([2|, where n stands for a subchannel candidate for reassignment, k*(n) for the 
current owner of the subchannel (as determined by Step 2), and k the index of the 
candidate reassigned user. 



r n.k*(n) T n,k 

a n,k = ■ 



r n ,k 



(2) 



The nominator of the metric expresses the decrease in sum-rate performance due to the 
reallocation, whereas the denominator the increase of the the infeasible user data-rate 
or equivalently the decrease of the distance from feasibility. Thus, by minimizing this 
metric we simultaneously harm as less as possible the original objective function value 
and approach as fast as possible the feasibility region. Notice, however that a specific 
resource-block reallocation is possible if the minimum QoS target for the owner-user is 
not violated. The particular iterative procedure ends when all CBR users' demands are 
satisfied. Finally, the last step of Phase 2 (Step 4) constitutes the release of redundant 
subchannels (if any) from the CBR-class users, similarly to the procedure followed in 
the 1st Heuristic. 

Complete Algorithm 

Step 1 Initializations - Declarations 

Let {r n ,k} neStkeW {pn,k} neS ^ w {lhs k } keUc ^{b k } keUc ^ as in Heuristic I. 



Step 2 Optimum Unconstrained (QoS-unauiare) Allocation 

Best-Rate Allocation: For each n£S, set p n ^* = 1, where k* — argmax{r n ^} 

feG hi 

Minimum QoS satisfaction check: If for all k £ Wcbr : lhs k ^ b k Then Go To Step 4 
Else Go To Step 3. 

1 During the reallocation subprocedure followed here, subchannels are removed from one 
user and given to another, whereas in the 1st heuristic subchannels interchanges occur 
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Step 3 Subchannels Reallocation - QoS Satisfaction 
Define the following subchannels and users subsets: 

Jcbr = {fc G Wcbr ; lhs k > b k } C Wcbr : the subset of CBR users for which minimum 
QoS is met. 

•J'cbr = Wcbr\^7cbr : the complementary subset of undersatisfied CBR users. 

Jbe ~ {k £ Wbe : lhsk > 0} C Wbe : the subset of BE users allocated at least one 

subchannel or equivalently posses non-zero data rate. 

J = i7cbr U Jbe : the subset of users from which we can remove subchannels 
J 1 — J"'cbr : the subset of users to which we must add subchannels in order to meet 
the minimum QoS requirements. M = {n G S,k G J : p nk = l}: the subchannels pool 
from which we can extract subchannels and reallocate to the users subset J . 

Repeat the following procedure while there exist unsatisfied CBR users, namely as long 
as J' 0: 

(a) For each subchannel n£jV identify the owner k* (n) = {fc G J : p nk = 1|. 

(b) Compose the 2D Reallocations Array A\jy\ x \jr\ = [a„.fe] , where: 

( " } — > if lhs k'(n) ~ r n,k*(n) > b k'(n) an d r n ^ k > 

' n,k 

+oo , elsewhere 

(c) Check all the elements of the array: 

If for all elements a n k = +00, Break the Loop and Go To Step 4. 

Else pick the subch./user combination according to \n',k'\ = arg min A and 

1 ' n£\M\,kdJ' 

perform the reallocation: p n ^k*(ri) = OiPn'.fc' = 1- 

(d) Update the related quantities/subsets: J , j' ,M, lhs k , k £ Wcbr- 



^n,k 



Step 4 Redundant CBR-class Subchannels Reallocation 
Define the following subchannels and users subsets: 

J = {k G Wcbr : Ihsfc > b k }: the subset of CBR users assigned redundant data-rate 
(over-satisfied users). 

N = {n £ S, fc£ J: p„jt = 1 }: possible redundant subchannels pool. 

Repeat the following procedure while there exist over-satisfied users, namely as long 

as 

For each n G M: 

(a) find the owner k* (n) = {fc G J : p n J t = 1} 

(b) If Ihsy —r nk * ^ b k * remove the subchannel n from the owner k* (n) and reallocate 
it to the "best" BE user in terms of achieved data-rate k': 



13 



p n ' k*(n') ~ ®>Pn',k' = 1) where k' (n) = arg max {r n ^\, Else Go To the next 
subchannel. 

4.3 Computational Complexity Estimation 

An estimation of the computational complexity of the proposed optimal and suboptimal 
schemes follows. With respect to the heuristic algorithms we take into account the 
involved searching, sorting and comparison operations. We assume that the order of 
CBR and BE users is the same that is, |WcbrI ~ I^BeI ~ K and that the number of 
subchannels is an order greater than the number of active users, namely N 3> K. 

Exhaustive and ILP/LP models We first consider an exhaustive search approach, where 
all the possible combinations of subchannels-to-users assignments are examined. The 
particular procedure has an exponential complexity order of 0(K N ), which is obviously 
computationally intractable. As far as the exact optimal ILP model there is no guar- 
antee for the execution complexity, due to its NP-hard nature, however it is expected 
to be significantly lower than the complete enumeration. The relaxed LP model on the 
other hand has a provable third-order polynomial solution complexity with respect to 
the number of the involved variables. Hence, in order to find the performance upper 
bound of the RRA problem we have to spend a set of computational operations of 
order O (N 3 K 3 ). 

Heuristic I Since the introductory Step 1 induces no complexity, we proceed directly 
with the next step. We first employ a necessary pre-sorting operation regarding the 
achieved data-rate values over all users, which costs KN\ogN operations (this is an 
implementation issue, and this is why it was omitted from the previous algorithmic 
description). Regarding Step 2 we perform at most N iterations for satisfying the CBR- 
class QoS constraints, and at each iteration we have to select the worst user in terms of 
the average achieved data-rate with complexity cost K and locate its best subchannel 
with cost logN (due to pre-sorting), leading to a total cost of NK + N log N operations. 
Step 3 involves N log K operations for identifying the best-rate user for each remaining 
subchannel. Step 4 is the most complex one, since it entails a series of comparisons: 
without loss of accuracy we assume that each user is preallocated on average N/K 
subchannels, and then for each user the assigned N/K subchannels are compared with 

the NIK subchannels of the complementary (K — 1) users. All users are scanned for a 

N N 9 
full cycle, hence the whole process costs — • — • (K — 1) • K « N operations. Finally, 

Step 5 involves at most N subchannel releases, and for each one the best-rate BE user 

must be located with cost log if, hence NlogK operations are needed. Combining 

all the above algorithmic steps we get KN log N + NK + N log N + N 2 + N log K 
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operations and after performing several manipulations and approximations we resort 
to an estimated computational complexity order of O (iV 2 + NK log AT) . 

Heuristic II Similarly to the 1st heuristic we first employ a pre-sorting operation for 
the 2D rates array which costs KNlogN operations. Step 2 needs NlogK operations 
in order to find the optimal unconstrained allocation. Step 3 contributes significantly 
to the overall complexity: at most iV subchannel reallocations are performed and for 
each one a reallocation cost matrix containing NK elements must be devised and then 
searched for the combination experiencing the minimal cost. Thus N ■ NK = N 2 K 
operations are needed to fulfil the particular step. Step 4 induces an additional cost of 
TV log K operations as in the 1st heuristic. Accounting for all the algorithmic steps we 
result to an overall approximate complexity order of O (N 2 + NK log AT) . 

5 Simulation Results and Discussion 

Simulation Setup The DL of a single-cell OFDMA-based packet data network is mod- 
eled and simulated in the context of this work, while the selection of system parameters 
reflects an LTE scenario [14] . A system bandwidth of 20 MHz is assumed, consisting 
of N — 100 orthogonal data subchannels. DL transmissions occur on frame bursts of 
Tf = 1 msec. A realistic pedestrian NLOS macro-cell urban channel from the Win- 
nerll models family is adopted [33] . The PHY-abstraction function of Cioffi is utilized 
for associating each channel-to-noise ratio level with an achieved data rate (see Ap- 
pendix fSJ- Channel conditions are assumed perfectly known at the BS, allowing for 
an opportunistic subchannel allocation. At the receiver side transmissions are harmed 
due to thermal noise with power density of Nq = —174 dBm/Hz. 

As far as the traffic/QoS models, a CBR/BE dual-class scenario is formed. Each 
CBR user requires i?™ in = 36 bits/OFDMA symbol; the overall required CBR load 
is determined by varying the number of users, where |Wcbr — 6-12. The number 
of BE users is held constant at |Wbe| — 5. System capacity may be calculated as 
Cmax = A^ ■ Cmax = 600 bits, where Cmax is the maximum supported data-rate per 
subchannel, however the true achieved cell-rate is expected to be lower due to the hard 
CBR rate constraints and BE-class users' power shortage. The simulation parameters 
are summarized in Table [T] 

We consider five schemes which are compared below in terms of the achieved sum- 
rate performance while we guarantee the CBR-class rates: 

— The exact optimal scheme extracted by solving the ILP optimization problem given 
in in Eq. © (IP) 

2 For simplicity reasons we may assume that an OFDMA frame carries one data-symbol. 
Thus all the data-rate/throughput quantities may be identically expressed in bits/OFDMA 
symbol/frame or simply in bits. 
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Table 1 Simulation Parameters 



Quantity 


Symbol 


Value/Comment 


Carrier Frequency 


fc 


2.5 GHz 


System Bandwidth 


BW 


20 MHz 


Subchannel Bandwidth 


Af 


200 kHz 


Number of Subchannels 


N 


100 


Transmission Error Rate 


Pe 


10" 6 


Noise Power Density 


N 


-174 dBm/Hz 


Higher Xx Order per suchannel 




6 bits/symbol 


Cell Radius 


Rcell 


2 km 


Channel Model 




Winnerll C3 BU-NLOS Macro 


Users Distribution 




Uniform 


Users Mobility Model 




Pedestrian (4 km/hr) 


Number of OFDMA frames per drop 




100 


CBR QoS model 


f pmin 1 

\ n k JfcGWcBR 


36 bits/OFDMA symbol 


CBR users 


Ki = WcbrI 


6-12 


BE users 


K 2 = |Wbb] 


5 


Max Number of drops 


jyulira 
drops 


1000 


Min Number of drops 


\fllim 
drops 


25 


Statistics Convergence Threshold 


& norm 


0.02 



— A scheme that provides us with a performance upper bound, obtained by solving 
the continuous relaxed version of the previous scheme (LP) 

— The 1st heuristic proposed in Sec l4.ll (HEUR1), for which we also run a version 
without using the swapping sub-procedure (HEUR1 no swap) 

— The 2nd heuristic proposed in Sec[0] (HEUR2) 

— A semi-random allocation algorithm (RANDOM) which: (a) satisfies the target 
rates for the CBR-class users, by picking them one by one and assigning their best 
available channel until all requirements are met and, (b) assigns the remaining 
subchannels to the BE users randomly 

All the algorithms are implemented in MATLAB and for the ILP/LP problems we 
utilize the CPLEX solver [31], calling it through the TOMLAB interface [35]. In order 
to capture the effect of different system parameters to the achieved performance, we 
simulate 20 realistic system scenarios by varying: 

(i) the CBR-loading/ QoS levels in the service area, by considering a different number 
of active CBR users (K\ = |W C BR = 6,8,10,12) 

(ii) the average experienced SNR conditions in the cell, by tuning the ratio of the 
BS transmission power (Pf, s ) to the minimum required Tx power for guaranteeing 
feasibility regarding the K\ data-rate constraints (P^g^Kx)) in each scenario 
( P bs/P[*BR = 2.0, 2.5,..., 4.0 B 

3 Note that lower values for the power availability metric (e.g. 1.0 or 1.5) are not examined. 
This is justified by the fact that at lower SNR conditions, one or more CBR rate constraints 
can not be satisfied ("outage" conditions), rendering the QoS constrained problem infeasible. 



16 



We define an arbitrary realization or "drop" as a system setup consisting of a set of 
users randomly placed in the cell, for which their large-scale channel conditions are held 
constant, whilst the small-scale conditions vary in time. Each drop consists of 100 con- 
secutive OFDMA frames, spanning 100 ■ Ty = 100 msec. For each scenario we simulate 
multiple statistically independent drops and record the average sum-rate performance 
of each scheme. The simulation is terminated if the normalized (to the mean) variance 
of the performance statistics drops below the convergence target threshold (which is set 
to 0.02) or if the maximum number of simulated drops (which is set to 1000) has been 
reached [37]. Moreover, a minimum number of 25 drops are executed in any scenario, 
in order to avoid transient effects. Thus, at least 25 • 100 = 2,500 and at most 1000 ■ 100 
= 100,000 optimization problems are formulated and solved for each scenario. 



Results and Discussion In FigsfTJ and we depict the sum-rate performance for all 
schemes as a function of the required CBR loading and the average experienced SNR 
conditions. The effect of the subchannel swapping procedure to the performance of 



the HEUR1 algorithm is illustrated in Fig[3](HEURl with/no swap). In Figs 4(a) and 
|4(b)| we provide the optimality gaps of our heuristic schemes (HEUR1,HEUR2) for two 
representative power availability scenarios. Finally in Table [2] we present the achieved 
performance of the heuristic schemes compared to the exact optimal (IP) as well as 
the performance difference between the IP and the relaxed LP approach. The main 
observations/comments regarding the behavior of the various schemes/algorithms are 
the following. 

(i) Both heuristic approaches perform close to the optimal allocation scheme and fol- 
low its performance trend for different system conditions (Figs lll2l) while their 
complexity cost is significantly lower. 

(ii) The proposed heuristics outperform the RANDOM assignment scheme significantly. 
This means that exploiting the inherent 2D rate selectivity /diversity of the system 
leads to a dramatic increase on the cell performance. The performance gain of the 
1st heuristic is 60.6% on average. In particular in low SNR conditions the gain is 
approximately 80% and at higher SNR conditions drops to 48.64%, which is still 
high. This is reasonable, since at deteriorating channel conditions, the selection 
of the "best" subchannels-set for each user becomes more critical. On the other 
hand the 2nd heuristic provides us with an average gain of 52.8%, which is still 
remarkable. 

(iii) The optimality gap (defined as the percentage sum-rate loss from the optimal 
scheme) for each algorithm depends on the CBR-loading and the received SNR 



In order to cope with such situations, an adaptive power allocation strategy must be employed, 
like the one we proposed in our recently published work l.'SOI . 
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conditions experienced in the cell. The gap seems to narrow as SNR conditions are 
improved through the increase in the BS power. This is justified by the fact that for 
higher amounts of BS power, larger data-rates per subchannel are supported (close 
to the upper bound of 6 bits), leading to the vast majority of available bandwidth 
resources experiencing high performance, and thus the selection of the optimal 
subchannel set for each user is not so critical anymore. 

(iv) The 1st heuristic has an average optimality gap of 6.7% for the lowest BS-power 
scenario which decreases to 2.3% for the highest BS-power scenario. On the other 
hand the alternative dual heuristic performs worse than the first one. Its recorded 
optimality gap is 15.4% for the lowest SNR scenario and 4.6% for the highest on 
average. 

(v) The worst-case performance for the 2nd heuristic is observed at the lowest SNR and 
highest CBR-loading scenario, where the performance loss compared to the optimal 



scheme is 23.87% (Fig 4(a) -rightmost group of bars). Recall that the particular 
algorithm first allocates the available resources ignoring the demanded QoS levels. 
Under such conditions, the CBR-class sum-rate heavily dominates the overall dual- 
class sum-rate, and thus the subsequent resources reallocation phase finds great 
difficulty in leading to a feasible allocation. In other words when the propagation 
conditions are harsh and the QoS targets demanding, it is better to first guarantee 
the strict CBR rate constraints and then look for an improved allocation. On the 



contrary at higher SNR/lower CBR-loading conditions (Fig 4(b) -leftmost group 
of bars) the achieved performance is quite high (or equivalently the associated 
optimality gap is quite low) since the distance between the original solution and 
the feasibility is significantly smaller. 

(vi) The performance of the 1st heuristic is not significantly affected by the CBR-loading 
conditions, contrary to the 2nd heuristic behavior. For the lowest power scenario 
the optimality gap of the 1st heuristic increases by 2.3% (from 5.9% goes to 8.2%) 
as the number of CBR users increases from 6 to 12, whereas for the 2nd heuristic 
the corresponding increase is 14.3% (from 9.6% goes to 23.9%). This is justified by 
the fact that the 1st heuristic focuses on finding a feasible solution by prioritizing 
CBR-class users at the initial allocation phase. Similar conclusions may be drawn 
for higher SNR scenarios as well. 

(vii) As far as the 1st heuristic the importance of the subchannel swapping step is 
demonstrated through Fig(3] where one may observe the improvement level of the 
sum-rate performance for all the simulation experiments. The performance gain 
varies from 1.2-7.7% and it is more noticeable in lower CBR-loading scenarios. In 
such cases a lower number of subchannels is required for the CBR-class and thus a 
larger number of re-asssignments is expected to occur. 
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(viii) Averaged over all conditions, the 1st heuristic achieves 96.21% of the optimal sum- 
rate performance while the 2nd 91.63% of it. 

(ix) The performance gap between the actual optimal solution (IP) and the upper bound 
(LP) is not negligible. For the lowest power scenario (Fig|4(aj"|| this gap ranges 



2.8% and 11.9%. Therefore, although the LP solution is extracted very efficiently 
compared to the IP, it often fails on providing a tight bound on the exact optimal 
performance. Consequently, if the LP bound is used as a performance benchmark 
for the evaluation of a suboptimal scheme, then the actual efficiency of the latter 
would be underestimated. 



6 Conclusions - Future Work 

The resource allocation problem of maximizing the sum-rate performance for a down- 
link OFDMA single-cell network (like LTE) assuming heterogeneous traffic requests 
and uniform power loading over the system subchannels was studied in this work. The 
problem was first mathematically modeled as an ILP optimization problem, allowing us 
to extract the actual optimal allocation decision and the associated maximum achieved 
sum-rate performance in reasonable execution time. By relaxing the integrality con- 
straints on the above model, a performance upper bound may also be extracted with 
3rd order polynomial complexity cost. In the second part of this work, motivated by 
the resemblance of the specific problem with a well-known combinatorial optimization 
problem that is, the Generalized Assignment, we developed two heuristic algorithms 
for efficiently allocating system subchannels to the competing classes and users. We 
finally demonstrated through extensive simulation experiments that the performance 
loss of the heuristic schemes compared to the optimal is rather low, especially for the 
1st heuristic, and that our schemes heavily outperform semi-random subchannel assign- 
ments. Possible extensions of this work may involve the cooperation of the proposed 
schemes with time-domain scheduling algorithms handling packet delay and fairness 
QoS objectives, the employment of practical channel state reporting schemes and the 
consideration of interference in multi-cell deployments. 

A The Data-Rate Abstraction Model 

Let (n, k) an arbitrary subchannel/user combination and assume that a bit-stream is trans- 
mitted from the BS to the k th user over the n th subchannel. We denote by P n % the allo- 
cated transmitted power, |/i nj fc| 2 the propagation channel power gain (which is known at both 
transceiver ends), Nq the noise power density and Af the bandwidth of each subchannel. 



between 4.4% and 17.9% whereas for higher BS power 
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CBR Load (bits) = # CBR users x 36 bits/user 
Fig. 1 System Rate performance vs CBR-Loading conditions 
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Fig. 2 System Rate performance vs average received SNR conditions 
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Fig. 3 The effect of the subchannels swap procedure to the HEUR1 performance 
Table 2 Achieved Optimality Gap values for all Scenarios 



Scenario 


IP/LP 


HEUR1/IP 


HEUR2/IP 




% 


% 


% 


{6, 2.0} 


95.59 


94.11 


90.36 


{6, 2.5} 


95.99 


96.94 


93.41 


{6, 3.0} 


97.18 


97.46 


94.58 


{6, 3.5} 


96.03 


98.21 


95.89 


{6, 4.0} 


97.29 


96.92 


96.32 


{8, 2.0} 


93.50 


94.45 


86.24 


{8, 2.5} 


93.60 


93.77 


92.16 


{8, 3.0} 


94.11 


96.74 


94.31 


{8, 3.5} 


94.62 


97.39 


95.48 


{8, 4.0} 


94.91 


97.80 


96.23 


{10, 2.0} 


88.55 


92.90 


85.56 


{10, 2.5} 


90.12 


95.61 


90.83 


{10, 3.0} 


91.24 


96.93 


93.28 


{10, 3.5} 


91.94 


97.63 


94.65 


{10, 4.0} 


92.62 


98.11 


95.59 


{12, 2.0} 


82.12 


91.84 


76.12 


{12, 2.5} 


85.54 


94.51 


85.27 


{12, 3.0} 


88.09 


96.44 


90.05 


{12, 3.5} 


88.62 


98.60 


92.78 


{12, 4.0} 


91.11 


97.97 


95.40 



Then, by applying the closed- form approximation model of pQ, the achieved data-rate (chan- 
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the depicted percentage values express 

the performance loss of each heuristic (HEUR1 ,HEUR2) 

from the exact optimal (IP). 
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(a) Lower SNR-conditions 
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the depicted percentage values express 
the performance loss of each heuristic (HEUR1 ,HEUR2) 
from the exact optimal (IP). 
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(b) Higher SNR-conditions 
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Fig. 4 Sum-Rate and Optimality Gap Performance 
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nel capacity) for preserving a minimum transmission error rate of P e will be given by Eq. (|3jl, 
where / stands for the abstraction function. 



n.k i "n,k i 



Pe) = log 2 



1 + 



*"n, k | 



(3) 



We further define the normalized channel-to-noise ratio as in Eq. JIJ, and by employing 
the Uniform Power Loading assumption, we resort to the expression of Eq. J5j where Pf ss is the 
total available BS power. As also seen in Eq. ||5} the achieved data rate on each subchannel 
is hard-limited by the highest available transmission order denoted by c max bits. The latter 
comprises a system constraint similar to the BS power. Finally, notice that if power loading is 
not a-priori known then the achieved data-rates can not be precalculated, since they depend 
on the allocated amount of power. In such scenarios joint power and subchannel allocation 
must be employed (see | 36l for example). 



7n,fe = |Vfc| 2 / [(1/3) ■ [<2 _1 (ft/4)] 2 ■ A^o ■ Af] 



r nk = mini log 2 1 + 



Tn,fc 



(4) 



(5) 
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